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jmO  QUALITY  UMPiCTBp  ■ 

1.  Introduction. 

The  Navy  and  the  Marine  Corps  have  been  continually 
concerned  about  the  antivehicle  and  antlshlp  mines.  The 
development  of  effective  mlneflslds  detection  procedures  are  of 
great  Importance  as  they  will  enhance  the  ability  of  the  Navy  and 
the  Marine  Corps  to  perform  their  tasks.  One  approach  that  haa 
bean  recently  studied  by  the  scientists  of  the  Navy  Is  the  use  of 
tests  for  randomness  (Mulse  and  Smith  1992} .  In  this  study  they 
express  the  need  to  develop  detection  methods  that  are  based  on 
two-dimensional  processes  that  incorporate  the  dependence 
structure  of  the  nearby  observations.  The  research  supported  by 
this  contract  addresses  this  need. 


In  Section  2  four  research  projects  will  be  described.  One 
raaearch  project  was  completed  and  the  article  based  on  It 
accepted  f(3r  publication  in  a  leading  journal  of  statistics, 
Journal  of  the  American  Statistical  Association,  the  second 
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project  Is  close  to  completion  emS  will  be  submitted  for 
publication  by  September  1.  The  other  two  project  are  at  an  early 
stajjfe,  but  with  a  clear  plan  of  solx;lntj  the  problems.  Two 
graduate  students  have  been  partially  supported  under  this 
contraci'  t(j  work  on  the  four  research  projects  referred  shove. 
Also,  I  am  iilannlng  to  be  In  touch  with  the  Navy  scientists 
Interested  in  the  minefield  detection  proV>lem. 

2.  Reaaaroh  Projacta. 

Let  X,,  ....,X„he  n  observations  In  a  two  dimensional 
rectangular  region  R  »  [0,L|]x[0,Lj] .  We  are  interested  In 
et\iidylng  statistical  procedures  that  are  effective  In  detecting 
patterns  or  chistera  of  points.  In  what  follows  the  four  research 
projects  mentioned  above  are  described  In  detail. 

(1)  Slmultanaoua  Confidence  Intarvala  for  Multinomial  Proportiona. 

Assume  that  the  rectangular  field  is  divided  as  follows  into 
two  dimensional  rectangular  subregions:  let  h^  >*  lal,2, 

where  n.  are  given  positive  integers.  For  1  i  1  i  n,  and 
1  1  J  i  iij  Define  the  random  variables 

Y^j  w  the  number  of  X  j's  in  I  ( 1-1  )hi,  lh^]x[  (  j~l  )hj,  jhj] . 


If  aiS9um«  complete  spetdel  randomneee  (oar),  l.e.  that  the  n 
points  are  generated  by  a  homogeneous  Poleeon  process  in  the  two 
dimensional  8i>ace,  then  conditional  on  the  occurrence  of  the  n 
points  Jn  t)>e  region  R,  (in  the  time  Interval  (0,t])  Y^j, 

1  <  1  ,<  and  l  J  j  <  n,,  have  a  multinomial  distribution 
with  parametera  n  and  p,-,  »  l/(n,n;f).  If  the  exact  location  of  the 
points  is  unknown  and  the  rectangular  subregions  defined  above 
are  of  a  considerable  sijse,  then  one  might  be  interested  to 
examine  the  assumption  of  cer  V»asod  on  the  observed  data  Y^.. 
Assuming  that  the  Yjj  are  diatrlhuted  according  to  a  multinomial 
distribution,  but  possibly  with  not  equal  p^’s,  wo  can  construct 
a  confidence  region  for  the  Pfj'a  with  a  specified  probability  of 
coverage  and  see  If  the  vector  ( l/(n ,  .  .  .  .  ,  l/(n,nj))  is  in 
that  region.  Now,  If  car  is  not  warranted,  then  we  might  ))e 
willing  to  assume  that  the  Y^j  are  distributed  according  to  a 
multinomial  distribution,  with  distinct  pjj's.  In  that  case  too 
we  are  Interested  to  estimate  the  pij'a  based  on  the  observed 
data . 

The  enclosed  article  "Slmultancus  Confidence  Intervals  and 
Sample  Size  Determination  for  Multinomial  Proportions"  studies 
this  problem  that  has  been  studied  by  many  researchers.  Using  a 
parametric  bootstrap  approach  along  with  Edgeworth  expansion  for 
approximating  the  rectangular  multinomial  probabilities  accurate 
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confidence  regions  are  conei:ructe<J  for  multinomial  proportions. 
Ptlllzlng  this  method  we  present  an  algorithm  for  determine  the 
sample  size  neede<5  to  construct  a  conf1de)\ce  region  with  a 
specified  vrolume  and  pi'oViabl .1  Ity  of  coverage. 

(11)  Parametric  Bootstrap  Inference  Based  on  Extreme  Order 
Statistics  In  a  Multlnlomlal  Experiment. 

In  this  project  we  have  the  same  net  up  as  in  (1). 

We  assuming  that  th*'*  V.-  are  distributed  according  to  a 

multinomial  distribution,  but  possibly  with  not  equal  p^i's.  Here 

'  ¥ 

testing  the  null  hypotheses  of  car  is  equivalent  to  testing  that 
the  p^j's  are  equal.  Me  propose  to  study  the  performance  of  the 
test  statistic  based  on  max{Y^j)  and  mln(y^.j),  i.e.  reject  If 
max(Y,;j}  i  a  or  mln(Y,|j)  i  b.  The  approximation  for  multinomial 
rectangular  prob-abllltles  derived  In  project  (1)  (Slson  and  Glaz 
1994)  will  be  utilized  In  studying  the  performance  of  this  test 
statistic.  It  will  enable  ue  to  evaluate  p-values  and  the  power 
of  the  test.  Moreover,  confidence  intervals  for  the  largest  and 
smallsst  multinomial  proportion  have  been  also  studied.  This 
project  will  be  completed  by  deptember  i. 
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(111)  Th«  Two  dlmanalonal  Ratchat  Scan  Statistic 

Iiet  .  .  .  .  jXj,  be  n  obaer vat. Ions  In  f-x  two  diwsriHlona.l 
rectangular  reg.te>n  R  »  [0,L,]xCO,L*] .  Let  h,-  **  L^/n^,  where 

n.;  are  given  positive  Integers.  For  1  i  1  .<  n^  and  1  i  j  l  Uj 
Def.lne  the  random  variables 

Yj|  ■  the  number  of  X|’s  in  [  { 1-1  )h ,,  Ih |]xt  ( j-l ) b Jlij] . 

If  we  assume  complete  spmt.lm.l  t'ttndcmneps  (car),  i.e.  that  the  n 
points  are  ge>\erate<l  by  s  homogeneoue  Poisson  process  In  the  two 
dlnu!‘ne,Uina,l  apace,  then  conditional  on  the  occurrence  of  the  n 
pointe  in  the  region  R,  (In  the  time  interval  (0,t3)  Y^j,  1  1  i 

n|  and  1  i  J  i  n2,  have  a  multinomial  distribution  with 
parameters  n  and  p.j  ■  l/(n,ny).  If  we  do  not  know  the  exact 
location  of  occurrence  of  the  n  observations  and  the  rectangular 
regions  are  small  we  might  be  interested  In  examining  the  number 
of  event  that  occur  In  several  adjacent  eubreglons.  For 
1  i  1^1  n^-m,+  l  and  1  i  J^  i  n2-m2+.l  we  define 

1  2“  1  1  ^+m  1“  1 

W{l,,l2)  -  r  r  Y^j. 

J«lj  1-1, 
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W{l,,lj)  i»  the  number  of  ohaervation«  in  e  reotengular  refllon 
thet  le  comprised  from  m^nj  adjacent  subrefllcns.  If  W(.t,,i2) 
exceeds  the  value  k<  then  k  points  are  clustered  within  thie 
reotrtnQular  region.  The  two  <51men«lonai  ratchet  scan  statistic  is 
defined  to  be 

W,„  ■  max  {W(li,ij);  11 1  ,.<in,-m,+  l ,  li  1  ;^n j-m^t  J }  < 

where  m  ••  (m.<mj)'.  (The  one  dimensional  ratcdiet  scan  statistic 
has  been  recently  studied  by  Krauth  1592).  We  have  started 
stti, dying  approximations  for  the  i1  Istr lljutlon  of  W,|,.  The  methods 
i-liat  we  are  using  Include  probability  Ineiiuallties  (Glaiz  1993) 
and  compound  Poisson  approximations  (Gla*.  Waus,  Wallenstein  and 
Ruos  1994). 

For  lii,in,“m^+l  and  lii;in^-my+3  define  the  indicator  random 
variables 


T(i,,i,)  -  1,  if  W(li,ij)  1  k 
•  0,  Otherwise. 


Tlie  two  dJmmwJonn.l  ratchet  scan  atatJatJr  is  defined  as 


Ij+m--?  l^+m,-! 

i;  »  r  Z  r(i„lj). 

J-Jj  1-1. , 
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Tt  comitiHi  tli«  numtaer  of  )‘i»ol':a)^<ju3ajr  rt^gionn  gJve»i  by 
[  1 I  .+m <“1  ]xt V  ^ ^  that  contain  k:  or  more  observat  ions. 

Even  An  the  one  dlmenslcmal  case,  there  are  no  acovtrate 

Ions  for  the  dlatrlbut Jon  of  f, .  In  this  case  we  ere 
plnnrilrig  to  employ  a  rompcvund  Polsnon  approx! inn t loow  (Kooh 
nir*/.  'Iaua  ,  RotjiM  anti  M**].  1  ensteln  We  win  attempt  to 

general  t'/.H  tliese  resnllM  .In  the  case  where  the  ha\/e  a 

irmlttnomJal  d  1  etr J but J on  but  with  different  Pn'e. 

■ 

(Iv)  Two  Dlmanaiona.!.  Dlsorata  Soan  Statistic 

In  this  project  we  have  the  seme  aet  up  ue  In  (HI),  only 
here  we  aseume  that  the  Vij's  are  Independent  Integer  valued 
rendom  variables.  The  binomial  and  the  Poleeon  models  are 
,inr.luc]ecl  In  this  stvtdy.  For  the  speolal  case  of  he.lng  1lr1 

Bernoulli  trials  the  strong  laws  and  the  extreme  value  asymptotic 
distt  Abut  Aon  of  W,,  has  been  etud.led  in  Dar.llng  and  Waterman  (1986 
and  1986). 

Under  the  null  hypotheses  of  randomness  we  wA.ll  assume  that 
t)ie  Y|;'s  are  Aid  according  to  a  specified  discrete  distribution, 
for  example  the  PoAseon  distribution  with  parameter  X j.  Under  the 
alternative  of  clustering  we  aseume  that  siamewhere  In  the 
rectangular  region  there  exists  a  suhreglctn  [  1  A  ,+m  ,“.1  ]xt  :l  i  j+m«-l  ] 
so  that  the  distribution  of  the  number  of  points 


T 
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In  that  nuhreu.lon  Is  Poiaaon  with  maaii  where  >  x  q,  Then 

the  nu], ]  hypotheeltt  of  randomneae  1«  rejected  at  the  a  level  If 
W,j  >  k,  where  a  «  P(Wn  ^  k).  To  Implement  this  clueter  detection 
procedure,  accurate  approximation®  for  the  diet rlhutlon  of  the 
two  dimeuHlonal  dlecrete  scan  etatlatlc  have  to  be  derived. 

Firsts  a  product-type  approximation  elmllai'  to  tlie  one  for  the 
one  dimensional  case  in  Qlaz  and  Naus  (1991,  S.28}  will  he 
derived.  Moreover,  Inequalities  for  the  tall  prohabl 1 1  tie®  of  the 
two  dlinenHj(2nal  discrete  scan  statistic  will  be  stu(!jed.  Tills 
will  extend  the  result®  In  Glax  and  Nsu®  (1991,  Theorem  1).  In 
tile  one  dimensional  case  the  bounds  were  asymptotloaii  y  1 1qiit .  Tt 
will  Intereetlng  to  Investigate  the  asymptotic  properties  of  the 
lueqtialltles  In  the  two  dimensional  case. 

Another  approach  to  approximate  the  distribution  of  W,^  that 
will  oe  studied  is  the  Poisson  and  compound  Poisson 
approximation®  using  the  Chen-Steln  method  (Arratia,  Goldstein 
and  Gordon  1989  and  1990,  Barbour,  Chen,  and  Loh  1991,  Barbour, 
Holst  and  Jan^ion  1992,  Rooe  1993,  Glaz,  Naus,  Wallenstein  and 
Roos  1994)).  These  approximations  will  be  of  great  Importance  in 
investigating  the  distribution  of  the  multiple  discrete  acan 
statistic  that  was  discussed  in  detail  In  (Hi)  for  the  ratcJiet 
scan  statistic. 
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